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METHOD OF JOINT DECODING OF POSSIBLY MUTILATED CODE WORDS 

BACKGROUND OF THE INVENTION 
Field Of The Invention 

[0001] The invention relates to a method of decoding possibly 
mutilated code words of a code, wherein an information word and an 
5 address word are encoded into a code word of said code using a 

generator matrix^ and wherein said address words are selected such 
that address words having a known relationship are assigned to 
consecutive code words. The invention relates further to a 
corresponding apparatus for decoding possibly mutilated code words 
10 and to a computer program for implementing said method. 

Description Of The Related Art 

[0002] In European patent Patent application Application No. 
01201841.2 . corresponding to U.S. Patent Application Publication 

15 No. 2003/0066014-A1 (PHNL 010331) , the concept of coding for 

informed decoding is described. An enhancement can be found in 
European patent Patent application A pplication No. 0 1-203-147 . 2 j_ 
corresponding to U.S. Patent Application P ublication No. 
2003/0095056-A1 (PH-NL 010600) . The key element of the inventions 

20 described in oaid European patont these applications is an 

appropriate selection of the mapping of information strings towards 
code words from the Error Correcting Code (ECC) that is applied. 
The aim of coding for informed decoders is to enable more reliable 
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information retrieval if the decoder a priori knows part of the 
encoded information. A typical example is in the field of address 
retrieval of optical media. In case of a forced jump to a certain 
sector, part of the address of the sector in which the read/write 
5 head will land is known taking the jump accuracy into account. For 
instance^ in DVR (Digital Video Recording)^ it is anticipated that 
on the spiral track on the optical disc, a series of messages is 
encoded which aids in the logical positioning of the read/write 
head above the disc, which^ e.g.^ contains copyright information, 
10 date information and a logical track counter. Of this information, 
only the logical track counter changes in between successive 
messages. Therefore, a priori known information can be available 
from successful decoding of previous messages. 

[0003] According to the solutions described in the above above- 
15 mentioned European patent applications, decoding of a second 

message can only gain from the decoding of the first message, if 
said first decoding is successful. A second decoding can than then, 
in turn^ aid in the decoding of the third message^ and so on. 
However, if the decoding of the first message fails, the fact that 
20 the second message has been encoded in a special way does not 

improve the error correcting capabilities, i.e.j_ the second and any 
further decoding is not helped by previous decodings. All decodings 
could then fail. In other situations, for example^ at the very 
start of a recording or playback session, not much is known about 
25 the actual landing place of the read/write head. In that case, only 
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the error correcting capabilities of the code can be used for 
retrieving information. 

SUMMARY OF THE INVENTION 
5 [0004] It is therefore an object of the present invention to 
provide an improved method and apparatus of decoding possibly 
mutilated code words which can be used if the error correcting 
capabilities of the code are insufficient to allow reliable 
information retrievals and which can also be used if there are so 
10 many errors that, despite application of informed decoding as 
described in the above mentioned European patent applications, 
these errors can not be corrected. 

[0005] This object is achieved^ according to the present 
invention^ by a method of decoding as claimed in claim 1, 
15 comprising the steps of: 

-[0006] decoding the differences of a number of pairs of possibly 
mutilated code words to obtain estimates for the differences of the 
corresponding pairs of code words, 

-[0007] combining said estimates to obtain a number of at least 
20 two corrupted versions of a particular code word, 

-[0008] forming a code vector from said number of corrupted 
versions of said particular code word in each coordinate, 
-[0009] decoding said code vector to a decoded code word in said 
code , and 
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-[0010] using said generator matrix to obtain the information 
word and the address word embedded in said decoded code word. 

A corresponding apparatus for decoding according to the 

present invention is claimed in claim 11, A computer program 
5 comprising computer program code means for causing a computer to 
perform the steps of the method as claimed in claim 1 when that 
computer program is run on said computer is claimed in claim 12. 
Preferred embodiments of the invention arc defined in the dependent 
claims . 

10 [0011] The present invention is based on the idea to employ 

certain relationships between consecutive code words and to jointly 
decode several such consecutive code words. u Consecutive" code 
words in this context shall mean code words which are read and/or 
inputted into the decoder subsequently, e.g.j, code words which are 

15 located in series next to each other in a data stream^ or which are 
stored in consecutive sectors on an information carrier^ such as a 
CD, DVD or DVR or magnetic disc. 

[0012] The proposed method of joint decoding comprises two main 
elements. A first main element comprises the step of obtaining 

20 estimates for the differences of various pairs of consecutive code 
words by decoding the differences of their corrupted versions. 
According to a second main element^ the decoding results of said 
differences of corrupted pairs of consecutive code words are 
combined resulting in a number of corrupted versions of one and the 

25 same code word. These corrupted versions then are all used for 



PHNL020183-SS-RED- 031806 



4 



PHNL 020183 

obtaining the desired code word from which^, finally^ the 
information word and the address word can be retrieved. Since the 
address words encoded in the code words have a certain known 
relationship, much can be said during decoding about the difference 
5 of possibly mutilated code words which knowledge is advantageously 
used according to the invention during decoding. 
[0013] Preferred cmbodimcnto of the otcp of forming the code 
vector arc defined in claimo 2 to 1 . Preferably, said code vector 
is formed by majority voting in each coordinate from the number of 

10 corrupted versions obtained from the estimates. If more than one 

value occurs most frequent among said number of corrected versions^ 
the corresponding coordinate of said code vector is erased 
according to a further preferred embodiment. Alternatively or in 
addition, reliability information available on the symbols of one 

15 or more possibly mutilated code words is used for selecting the 
coordinates of said code vector according to still another 
preferred embodiment. Reliability information could be information 
about the probability that a certain value is correct. If 
available, it is preferred to include reliability information for 

20 each of the bits of the code vector for enhancing its decoding. 
[0014] A preferred way of obtaining an estimate for the 
difference of a pair of code words is to decode the difference of 
the corresponding pair of possibly mutilated code words to the 
closest code word from a subcode consisting of all possible 
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differences of two consecutive code words of the main code which 
closest code word is then used as said estimate. 
[0015] The probability of incorrect decoding can be further 
reduced by introducing an extra check according to which the 
5 obtained estimates are checked if they show a predetermined form 
and/or have a possible value. Preferably, if this check fails, the 
decoding result should be rejected. 

[0016] The present invention is advantageously used for decoding 
of code words stored on an information carrier, such as an optical 

10 or magnetic disc. According to standards used for optical 

recording, such as the CD-DA or the DVR standard, address words 
assigned to consecutive code words are consecutive, e.g.j_ are 
subsequently increased by one, and preferably represent the sector 
address of the sector in which the corresponding code word is 

15 stored. Said relationship between the address words assigned to 

consecutive code words will be exploited by the present invention. 
[0017] For reducing decoding delay, it is advantageous if the 
number of pairs of possibly mutilated code words to be decoded is 
as small as possible. However, at least two pairs of two 

20 consecutive possibly mutilated code words have to be decoded so 

that at least three estimates can be obtained^ since particularly 
majority voting on two alternatives is not useful. If reliability 
information is used instead of majority voting^ one pair might be 
sufficient. 
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[0018] According to another aspect of the present invention^ it 
is proposed that in said step of combining said estimates to obtain 
a number of corrupted versions of a particular code word^_ a first 
corrupted version corresponds to a first possibly mutilated code 
5 word, a second corrupted version corresponds to the difference 

between a second possibly mutilated code word and a first estimate, 
obtained by decoding the difference between said first and said 
second possibly mutilated code words, and a third corrupted version 
corresponds to the difference between a third possibly mutilated 
10 code word, said first estimate and a second estimate, obtained by 
decoding the difference between said second and said third possibly 
mutilated code words. 

[0019] According to still another aspect of the invention^ the 
proposed solution could also be used in combination with the 

15 solutions described in the above above- mentioned European p atent 
applications using a priori known information during decoding. A 
preferred way could be that, in a first step, the decoder tries to 
decode a possibly mutilated code word using a priori known 
information available on the address word embedded in said possibly 

20 mutilated code word. If this decoding fails or if the result is not 
reliable enough, then the present solution of joint decoding could 
be used. However, it is also possible that the present solution is 
always used in addition to the use of a priori known information 
during decoding. 

25 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0020] The invention will now be explained more in detail with 
reference to the drawings, in whichj_ 

[0021] Fig. 1 shows the format of a data word information to be 
5 encoded-^. 

[0022] Fig. 2 shows the format of a code word-j_ 

[0023] Fig. 3 shows a block diagram of an encoding and decoding 

s cheme— ; and 

[0024] Fig. 4 shows a block diagram of the method of decoding 
10 according to the present invention. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0025] In the following, it is assumed that information is 
represented as a k-bits string called data word d as shown in Fig. 

15 1, said data word d comprising an address word a(i) and an 
information word m. The address word a(i) is a b-bits string 
representing the sector address for sector i; the information word 
m is a (k-b) -bits string containing any information to be stored, 
such as audio, video, software, copyright or date information or 

20 any other kind of data. It should be noted that the address word 

a(i) is known if i is known and vice versa; however, knowledge of i 
does not give information on the information word m. 
[0026] The data word d shown in Fig. 1 is encoded using a k x n 
binary generator matrix G so that the data word d=(a(i), m) is 
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mapped on the n-bits code word c(i, m)=(a(i), m)G as shown in Fig, 
2. 

[0027] Fig. 3 shows a block diagram of a typical system using 
encoding and decoding. Therein user data, e.g.^ audio or video 
5 data, coming from a data source 1, e.g.^_ recorded on a master tape 
or master disc, are encoded before they are stored on a data 
carrier, e.g.^ a disc, or transmitted over a transmission channel, 
e.g.^ over the internet Internet , before they are again decoded for 
forwarding them to a data sink 9, e.g.^_ for replaying them. 

10 [0028] The user data of the source 1 are first encoded by a 

source encoder 2, then error correction encoded by an ECC encoder 
3^ and thereafter modulated by a modulator 4, e.g.j_ an EFM 
modulator, before the encoded user data - the code words - are put 
on the channel 5 on which errors may be introduced into the code 

15 words. The term "channel" 5 ohall is here interpreted broadly, 

including and includes a transmission channel as well as storage of 
the encoded data on a data carrier for a later replay. 
[0029] When replay of data is intended, the encoded data first 
have to be demodulated by a demodulator 6, e.g.^_ an EFM 

20 demodulator, before they are error correction decoded by an ECC 
decoder 7 and source decoded by a source decoder 8. Finally, the 
decoded user data can be input to the sink 9, e.g.^ a player device 
for replay of the user data. 

[0030] The method of decoding according to the present invention 
25 shall be explained more in detail with reference to Fig. 4. It 
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shall be assumed that data stored in encoded form on the record 
carrier 10 shall be replayed. In a first step, an amount of data r 
are read by a reading unit 11 and forwarded to an encoding 
apparatus 12, During its way from the encoder to the decoder^ 
5 errors might be introduced into code words, e.g.^ by scratches on 
an optical record carrier or by transmission errors, so that the 
read code words r are possibly mutilated. Those errors shall be 
corrected by the decoder 12 . 

[0031] At first, the difference D of code words situated in 
10 consecutive sectors (or located consecutively in a transmitted data 

stream) is computed in unit 13. The difference D of code words 

and r^ + i in the sectors i and i+1 is computed as follows: 
c(i, nvjjecd+l, m 2 ) = (A(i), m 1 ©m 2 )G, 

wherein A (i) =a (i) ©a (i+1) for 0^i<2 b -2. The key observation is that 
15 for appropriate choices of the address word a, much can be said of 

the difference A(i) of two consecutive address words. It should be 

noted that 0 indicates a modulo -2 operation in which additions and 

subtractions have the same result. 

[0032] Assuming that corrupted versions r^, r 2 , r3 of L = 3 
20 consecutive code words c lt c 2 , c 3 are read^ one can write for j = 
1, 2, 3: 

rj = c(i+j-l, mj)©ej = (a(i), mj)G©ej, 

ej representing an error vector. It is clear that 

D 12 = r!©r 2 = (a(i), m^) G© (a (i+1) , m 2 )G©(e!©e 2 ) 

25 = (A(i), m 1 ©m 2 )G©(e 1 ©e 2 ) jL and 
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D 23 = r 2 ©r 3 = (A(i+1), m 2 ©m 3 )G©(e 2 ©e 3 ) . 

These L-l = 2 differences D12 and D23 are computed by unit 13 and 
inputted into the first decoding unit 14 . 

[0033] In the decoding unit 14^_ said differences D 12 , D 23 are 
5 decoded each to the closest code word from a subcode C" which 

consists of all possible differences of two consecutive code words 
c of the main code C. This could^ e.g.^_ be done by comparing the 
differences D 12 / D 23 with all possible code words of the subcode C" 
and by selecting the closest code word as estimate u for c(i, 
10 m 1 )©c(i+l / 1112) and as estimate v for c(i+l, m 2 )©c(i+2, m 3 ) . Thus, 
in the first decoding unit 14_^ estimates u, v for the differences 
of the pairs of code words C!©c 2 and C2©c 3j _ corresponding to the 
pairs of possibly mutilated code words r!©r 2 and r 2 ©r 3j _ are 
obtained. 

15 [0034] These estimates u # v are combined in unit 15 by computing 

w^: = ri = c(i, mi)©ei 

w 2 : = r 2 ©u= r 1 ©(r 1 ©r 2 ©u) 

w 3 : = r 3 ©u©v = r 1 ©(r 1 ©r 2 ©u)©(r 2 ©r 3 ©v) . 

If the estimate u is correct, then r!©r 2 ©u = e!©e2- Similarly, if 
20 v is correct, then r 2 ©r 3 ©v = e2©e 3 . Hence, if the estimates u and 
v both are correct, then 

wi = c(i, mi)©ei 

w 2 = c(i, m 1 )©e 2 

w 3 = c(i, mi)©e 3 . 
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In combining unit 15^ a number L = 3 of corrupted versions w lf w 2 , 
w 3 of the particular code word c^ = c(i, m^) are thus obtained. 
[0035] Next, in unit 16 ncxt ^ the code vector z is constructed 
by component -wise majority voting of the corrupted versions w 1# w 2/ 
5 W3 of the code word ci- That is # for each i e {l, 2, n} , the i- 
th component z± of the code vector z is an erasure, if vjh, w 2 i and 
W3i are distinct; otherwise, the component z^ equals the most 
frequent element among wn, w 2 i, w 3 i. The code vector z is then 
decoded by a second decoding unit 17 for the code that 

10 dccodco the second decoding unit 17 decoding the code vector z— 3r£- 
into a code word c" of said code C. Finally, in unit 18^ the 
generator matrix G, which had been used by the encoder to encode 
the address words and the information words into code words, is 
used to finally retrieve the information word m and the address 

15 word a embedded in said code word c". 

[0036] In general, a number of corrupted versions of L 
consecutive code words are read, say rj = c(i+j-l, mj)©ej for j = 
1, 2, L. Estimates for the differences of each of the (L-l) 
pairs of consecutive code words are obtained. By combining these 

20 estimates, L corrupted versions w 1# w 2 , w L of the code word c ± = 
c (i, mi) are obtained. If all the estimates are correct, then it 
holds wj = c(i, mi)©ej for j = 1, 2, L. The code vector z is 
obtained as the majority vote of w 1# w L in each of the 
coordinates. If^_ in a certain coordinate^ more than one symbol 

25 occurs most frequent, this coordinate in the code vector z is 
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erased. Finally, the code vector z is decoded to a code word in the 
code C. 

[0037] For reducing decoding delay, it is advantageous if the 
number L is as small as possible. With L = 2, the described method 
5 is not appropriate, as majority voting on two alternatives is not 
useful. If reliability information, also known as soft decision 
information, is available on the bits of the possibly mutilated 
code words r± and r2, reliability information for each of the bits 
of w 2 : = r 2 ®u can be obtained according to a well-known method. The 

10 code vector z can now, instead from majority voting, be obtained by 
replacing setting the coordinates z-^ to the most reliable of the 
bits from rn and W2i- For enhancing the decoding of the code 
vector reliability information could be included for each of the 
bits of the code vector z. The reliability information for bit i in 

15 z is obtained by combining the reliability information of the bits 
r!i and w 2 i- 

[0038] Next, two special cases shall be briefly discussed. In a 
first special case, it is considered that the information word a(i) 
is the conventional k-bits binary representation of the integer i. 

20 For example, if k = 8, then a (57) = 00111001, as 57 = 

0»2 7 +0*2 6 +l«2 5 +l*2 4 +l*2 3 +0*2 2 +0*2 1 +l«2 0 . The binary representation 
of two consecutive integers nearly always has the same leftmost 
bit; the only exception is the address 011-1 that has 10...0 as_a 
successor. Therefore^ it can be assumed, with only a very small 

25 probability of being wrong, that the leftmost bit of the difference 
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of two consecutive address words A(i): = a(i)©a(i+l) equals zero. 
More generally, it will be shown that it is very likely that A(i) 
starts with many zeros. In other words, a solution as proposed in 
the above mentioned European patent applications EP 01-201-841.2 
5 and EP 01203147 . 2— . of using a priori known information in the 
decoder^ can be applied since it is known that a lot of the 
leftmost information bits of A(i) are (with a high probability) 
equal to zero. 

[0039] Let i be an integer between 0 and 2 b ~ 2 . Let j be the 
10 number of ones in which (a) i ends, and further 0 ^ j ^ b-1. One can 
write (a) i = sOlD , where s has length b-j-1, and lJ denotes a 
string of j ones. It can be easily gathered that a(i+l) = slOD , and 
A(i) = a(i+l)0a(i) = 0 b -j- 1 lj +1 . 

[0040] The following conclusions can be drawn: 
15 a) for each ie{0, 1,..., 2 b -2}, A(i) is of the form 0 b " m l m for some 
m€{l, 2, ...,b}. 

b) A(i) = 0 b ~ m l m if and only if a(i) ends in (m-1) ones. 
[0041] From conclusion b)^_ it follows that the number of 
integers i for which A(i) starts with b-m zeros and ends in m ones 
20 equals 2b-m. Stated differently, for n£l, the fraction of integers 
ie{0,l, 2 b -2} for which A(i) ends in exactly m ones equals 2 b ~ 
m /(2 b -l)*(^) m . 

[0042] For example, the fraction of integers i for which A(i) 
ends in^ at most_j_ 4 ones approximately equals 1/2+1/4 + 1/8+1/16 = 
25 15/16 = 0.9375. That is, if it is assumed that A(i) starts with b-4 
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zeros, one is correct in nearly 94% of the cases. If it is assumed 
that A(i) starts with b-8 zeros, the minimum Hamming distance drops 
when the idea of using a prior known information symbols in the 
decoder, but the assumption is correct with a much larger 
5 probability of 255/256 * 0.9961. 

[0043] After decoding a corrupted version of (A(i), m 1 ©m 2 )G, 
i.e.^ after decoding the difference Di2/ i fc should be checked if 
the purported value for A(i) is of the form 0 n " m l m for some m^l. 
If not, the decoding result should be rejected. This extra check 

10 greatly reduces the probability of incorrect decoding, e.g.j_ 
copyright information. According to another special case all 
sectors have the same information word m. The difference between 
the code words for the sectors i and i+1 can be computed as follows 
c(i, m)©c(i+l, m) = (A(i), 0)G. 

15 [0044] In other words, the difference of any two consecutive 
code words is in the code C^, defined as 
C A ={(A(i), 0)G|0^i<2 b -2} . 

[0045] The difference of corrupted versions of two consecutive 
code words can therefore be decoded to the code C^. The code C A , 
20 which is a subcode of the main code C, has small cardinality if the 
set of difference vectors {A (i) | 0<i^2 b -2} has small cardinality, 
and in that case, its minimum distance may well exceed the minimum 
distance of the code C. 

[0046] In the special case that the address word a(i) is the 
25 conventional binary representation of i, it holds 
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C A ={(0i, 0 k " b )G|0<i^b-l}. 

Consequently, the subcode C A only contains b words, but not 2 b -l 
words, and optimal decoding can easily be performed by comparing 
the difference of two consecutive possibly mutilated words r with 
5 all b words from the subcode C A . It should be noted that the number 
of comparisons to be made is linear, not exponential, in b. 
[0047] Since A(i) does not u sually not end in many ones, it 
might be considered to decode to an even smaller subcode, namely 

CT A = {(0i, l b -i, 0 k " b )G | i < b-l} 

10 where b" is an integer between 0 and b-l. The larger b", the 
smaller C" A , but also the smaller the likelihood of correct 
decoding . 

[0048] Also in case that the address representation a 
corresponds to binary Gray encoding, the subcode C A only has b 

15 elements. This is because, by definition, binary Gray encoding 

means that two consecutive addresses only differ in one position, 
that is, for each i, A(i) consists of one 1 and b-l zeros. 
[0049] The present invention constitutes an effective and 
reliable method for retrieving information stored in code words 

20 situated in several consecutive sectors or transmitted subsequently 
in a data stream. It employs certain relationships between 
consecutive code words and jointly decodes several such consecutive 
code words. The present solution can be applied in any encoding and 
decoding system where address words having a known relationship are 

25 assigned to consecutive code words. 
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ABSTRACT^ OF THE DISCLOSURE 

Tho invention rolatco to a ln a method of decoding 
possibly mutilated code words (r) of a code (C) , wherein an 
5 information word (m) and an address word (a) are encoded into a 
code word (c) of said code (C) using a generator matrix (G) and 
wherein said address words (a) are selected such that address words 
(a) having a known relationship are assigned to consecutive code 
words (c) . To , to provide a reliable way of decoding making use of 
10 the known relationship, ar -the m ethod comprioinq includes fe he 
following otopo is propoocd ; 

decoding the differences (D) of a number (L-l) of pairs 

of possibly mutilated code words (r^, r-[ +1 ) to obtain estimates 
(u, v) for the differences of the corresponding pairs of code words 
15 (c-l, Ci +1 ) , 

combining paid the estimates (u, v) to obtain a number 

(L) of at least two corrupted versions (wj) of a particular code 
word (c) , 

forming a code vector (z) from said the number (L) of 

20 corrupted versions (wj) of paid the p articular code word (c) in 
each coordinate, 

decoding paid the code vector (z) to a decoded code word 

(c') in paid the code (C) , and 
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_using oaid the generator matrix (G) to obtain the 

information word (m) and the address word (a) embedded in said the 
decoded code word (C) • 

5 (Fig. 4) 
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